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ABSTRACT 

We report on the small scale (0.5 < r < 40ft.~ 1 Mpc) clustering of 
78895 massive (M* ~ lO n ' 3 M ) galaxies at 0.2 < z < 0.4 from the 
first two years of data from the Baryon Oscillation Spectroscopic 
Survey (BOSS), to be released as part of SDSS Data Release 9 
(DR9). We describe the sample selection, basic properties of the 
galaxies, and caveats for working with the data. We calculate the 
real- and redshift-space two-point correlation functions of these 
galaxies, fit these measurements using Halo Occupation Distri- 
bution (HOD) modeling within dark matter cosmological simula- 
tions, and estimate the errors using mock catalogs. These galaxies 
lie in massive halos, with a mean halo mass of 5.2 x 10 13 /i _1 Mq, a 
large scale bias of ~ 2.0, and a satellite fraction of 12 ± 2%. Thus, 
these galaxies occupy halos with average masses in between those 
of the higher redshift BOSS CMASS sample and the original SDSS 
I/II LRG sample. 
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1 INTRODUCTION 

The large scale structure traced by galaxies is 
fundamentally dependent on the cosmology of 
the early Universe. Because of the growth of 
early perturbations due to gravitational attrac- 
tion, massive dark matter halos are more strongly 
clustered than less massive halos. As massive 
galaxies preferentially live in massive halos, we 
can use large surveys of massive galaxies to probe 
the evolution of dark matter halos through cos- 
mic time. Past galaxy redshift surveys such as 
the Two-Degree Field Galaxy Redshift Survey 
( Colless et atl|2003[l and Sloa n Digital Sky Sur- 



vey (SPSS; York et al. 2000| I/II galaxy sam 



pies ( |Eisenstein et al.||2001| |Strauss et al.||2002 1 
provided large catalogs to constrain the proper- 
ties of galaxies and their halos in the relatively 
local Universe. The Baryon Oscillation Spectro- 



scopic Survey (BOSS; Schlegel, White & Eisen- 
stein||2009|), part of the SDSS-IH project (|Eisen- 



stein et al.|2011 I, includes populations of galaxies 
and quasars to probe the evolution of large scale 
structure over cosmic time. 

This paper presents the first measurements 
of the clustering of the low redshift (0.2 < z < 
0.4) BOSS galaxy sample. This study includes 
observations from June 2010 through June 2011, 
and compares this sample with the high redshift 
BOSS sample (0.43 < z < 0.7) that was ana- 
lyzed by | White et al.| pOlTj ), and | Anderson et al.] 



(20121, as well as earlier SDSS I/II galaxy sam- 



pies ( 


Eisenstein et al.||2001 


Zehavi et al. 


2005a 


Wake et al.| |2008| |Padmanabhan et al. 


|2009 



Zheng et al.|2009| |Tojeiro fc Perci val 2010) from 
a similar redshift range. The galaxy redshift in- 
formation used in this analysis will be released 
as part of the Data Release 9 (DR9) public cat- 
alog. BOSS wil l use these data and the CMASS 
galaxy sample ( |Anderson et al.|2012[ ) to measure 
the Baryon Acoustic Oscillation (BAO) signature 
in the correlation function and power spectrum to 
high precision across a range of redshifts. This in- 
formation will provide precise constraints on cos- 
mology that are nearly orthogonal to those pro- 
vided by cosmic microwave background (e.g.|Beii^ 



nett et al.|2003 Komatsu et al.|2011 1 and super- 


nova studies (e.g. |Kowalski et al. 


2008 


Kessler 


et al.|2009||Sollerman et al.|2009||l 


^ampeitl et al. 



2010 



Amanullah et al. 



2010). 



SDSS I/II had two galaxy samples: the main 
sample ( Strauss et al.|200"2 I , with a mean redshift 
z ~ 0.1, intended to broadly sample all classes 
of galaxies over a wide range of luminosity and 



color, and the Luminous Red Galaxy (LRG) sam- 
ple ( |Eisenstein et al.|2001[ ), with a mean redshift 
z ~ 0.3, intended to provide a large effective vol- 
ume for large scale structure studies. The LRG 
sample provided the first clear detection of the 
BAO feature in the galaxy correlation function 
and power spectrum ( Eisenstein et al.[2 005 ) , mo- 
tivating the design of BOSS. The BOSS galaxy 
samples were selected to produce a mostly sam- 
ple variance limited measure of BAO to z = 0.7. 
BOSS thus includes a z < 0.45 sample (LOWZ) 
with higher number density than the SDSS I/II 
LRGs and a higher redshift sample (CMASS) of 
similar space density. 

The primary goal of this paper is to char- 
acterize the LOWZ BOSS sample and compare 
it with other samples of massive, red galaxies 
at similar redshifts. We begin with a description 
of the galaxy sample, including selection criteria 
and caveats in Section [2] We describe the over- 
all clustering properties in Section |3.1[ our Halo 
Occupation Distribution (HOD) model in Section 
|3.2| our error derivation in Section |3.3| and our 
technique for fitting the correlation function in 
Section |3.4| As a test of our fitting procedure, 
we compare the resulting HOD models with the 
measured redshift space clustering in Section]!] In 
Section [5] we compare the properties of this sam- 
ple with a number of previous studies. Section [6] 
summarizes our results and some technical details 
of our HOD modeling appear in an appendix. For 
this work, we quote distances as comoving sepa- 
rations in h~ 1 Mpc and convert redshifts to dis- 
tances assuming a flat ACDM cosmology, with 
Q M = 0.274. 



2 THE SAMPLE 



The S loan Digital Sky Survey (SDSS; [York et al. 
|2000[ ) mapped over one third of the sky using 



the dedicated 2.5-m Sloan Telescope (Gunn et al 



2006) located at Apache Point Observatory in 
New Mexico. A drift-scanning mosaic CCD cam- 
era ( Gunn et al.|1998 1 imaged the sky in five pho- 



tometric bandpasses (Fukugi ta et al.|1996 Smith 
|et al.|2002| |Doi et al.|2010[ ) to a 5a limiting mag- 
nitude of r ~ 22.5. The imaging data were pro- 
cessed through a series of pipelines that perform 
astrometric calibr ation (|Pier et al.||2003[ ), photo- 
metric reduction (Lupton et al. 20011, and pho- 
tometric calibration ( Padmanabhan et al.| [2008). 
The magnitudes were corrected for Galactic ex- 
tinction using the maps of Schlegel, Finkbci ner fc| 
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|Davis|([l998|. BOSS, as par t of the SDSS-III sur- 
vey (jEiscnstcin et al. 201l]), has imaged an addi- 
tional 2400 square degrees of the South Galactic 
sky in a manner identical to the original SDSS 
imaging. 



BOSS targetted two galaxy samples (Pad- 
manabhan inprepl: CMASS, at z ~ 0.5 and ini- 



tially analyzed in |White et aL] ( |2011[ ), and LOWZ, 
at z ~ 0.3 , which is the focus of this study. This 
sample was selected as an extension of the SDSS 



I/II LRG (henceforth: Legacy) sample (Eisen- 
stein et al.|2001 1, with three times its space den- 



sity. The goal of both BOSS galaxy target se- 
lection methods is to identify luminous, highly 
biased galaxies, with a galaxy number density 
N(z) ~3x 10" 4 /i 3 Mpc" 3 . These galaxies should 
represent the most strongly clustered galaxies at 
that space density and redshift range. While the 
CMASS sample was targeted to be an approx- 
imately mass-limited sample of galaxies with a 
range of colors (about 25% are blue) , LOWZ con- 
sists primarily of red galaxies. Up to ~ 30% of 
LOWZ targets were observed during the Legacy 
survey, and thus already have a redshift. This re- 
duces the number of new redshifts required, but 
slightly complicates the analysis, as the complete- 
ness must be handled differently for Legacy and 
new BOSS redshifts. The Legacy redshifts do not 
impact the uniformity of the targeting in a given 
region, as they were ignored when assigning fibers 
to targets (see the discussion about completeness 



in Section 3.1 \ 



When defining colors, we use the SDSS model 
magnitudes which were computed using either an 



exponential ( Freeman 1970 1 or a de Vaucouleurs 



(de Vaucouleurs 1948| light profile fit to the r- 



band only, and are denoted with the mod sub- 
script. Composite model magnitudes are com- 
puted using the best-fit linear combination of an 
exponential and a de Vaucouleurs light profile fit 
to each photometric band independently, and are 
denoted with the subscript cmod. Point-spread 
function (PSF) magnitudes are computed by fit- 
ting a PSF model to the galaxy, and are denoted 
with the subscript psf . 

The LOWZ galaxy target selection algorithm 
is a straightforward extension of the method of 



Eisenstein et al. (20011 to fainter magnitudes to 



increase the number density. We define two pa- 
rameters based on the ugriz model magnitudes, 

C|| = 0.7(p mod -r mod ) + 1.2(r mod -i mod -0.18)(l) 

c± = (r mod -!mod) — (flmod — »" mo d)/4.0 — 0.18.(2) 



We target galaxies that are luminous and red with 
a redshift z < 0.4 with the following cuts. 



' cmod < 13.5 + C||/0.3 

|c x | < 0.2 

16 < r < 19.6 

> 0.3 



(3) 
(4) 
(5) 
(6) 



This selection follows the color tracks of a pas- 
sively evolving stellar population (Eq.j2|, and se- 
lects an approximately absolute magnitude lim- 
ited sample (Eq.[3| with a sliding cut in color and 
luminosity. Equation[6]is the primary star/galaxy 
separation, based on the difference between a 
modeled galaxy light profile, and a PSF profile. 
Note that this is different from the LOWZ tar- 
get selection algorithm used during the first 9 
months of BOSS (roughly, data taken through 
June 2010), which was affected by a change due 
to a bug in the star-galaxy separation. The earlier 
data have a lower on-sky density and cannot be 
corrected to reflect the new targeting, as was done 
inlWhite et al.l (120111) andlAnderson et al.N2012| 



for CMASS. Because of this issue, we restrict our- 
selves to regions that were tiled for spectroscopy 
with the corrected target selection, via 

TILE ^ 10324. 

Plates outside this range were drilled with the 
earlier target selection and star/galaxy separation 
algorithm. We recommend users of this sample 
who wish to perform large scale structure analy- 
ses on the LOWZ sample apply the same cuts on 
the catalog data to remove the early data with a 
different selection. 

We define "good" redshifts as follows, where 
is the bitwise and operator. For new BOSS 
observations, we require 

• (BOSSTARGETl && 2°) > 

• SPECPRIMARY == 1 

• ZWARNINGNOQSO == 

while for SDSS Legacy observations we require 

• SPECPRIMARY == 1 

• ZWARNING == 

These parameters are part of the SDSS DR9 cat- 
alog: BOSS TARGETl is a bitmask containing 
the target selection flags, SPECPRIMARY iden- 
tifies the best spectrum among multiple observa- 
tions, and ZWARNING NOQSO and ZWARN- 
ING are bitmasks listing potential problems with 
the redshift fit, with a value of representing 
no obvious problems. We do not require that 
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Figure 1. All Panels: LOWZ (solid blue), split into SDSS I/II LRGs (Legacy, dot-dashed cyan) and new BOSS 
(dashed cyan) galaxies, and CMASS (dashed red). Vertical dotted lines show the mean values for LOWZ and 
CMASS. All values taken from |Maraston et aTT] | |2012| ) using LRG models from |Maraston et aI7| ( |2009[ l. Left panel: 
Absolute r-band rest frame magnitudes. These magnitudes are K+e-corrected to z = 0, including corrections for 
passive evolution. CMASS galaxies and LOWZ galaxies with redshifts from Legacy are more luminous, in general, 
than the new LOWZ BOSS galaxies. Center panel: Stellar mass in log 10 (M*/MQ). BOSS galaxies with Legacy 
redshifts have higher masses, on average, than those with new BOSS redshifts. Right panel: K+e-corrected rest 
frame g — r colors. CMASS contains bluer galaxies than LOWZ, while the colors of Legacy and new BOSS galaxies 
are very similar. 



the spectrum is identified as a galaxy spectrum, 
i.e., if an object passed the targeting cuts, and 
the spectrum satisfied the above requirements, a 
non-galaxy spectrum (e.g. strong emission lines, 
quasars) would be included in the sample. 

In Figure[T]we show the absolute r-band rest 
frame magnitude, stellar mass, and g — r rest 
frame color distributions of this sample, K+e- 
corrected to z = 0, using values from Ma raston] 
|et al.| ( |20T2|. The stellar masses were calculated 
assuming a Kroupa (2001) initial mass function, 
including stellar mass loss due to stellar evolu- 
tion. We also plot distributions for the CMASS 
sample and split the LOWZ sample into Legacy 
and new BOSS objects to show the difference be- 
tween SDSS I/II targets and new BOSS targets. 
LOWZ galaxies are typically half a magnitude 
fainter than CMASS galaxies, but CMASS con- 
tains more blue galaxies. LOWZ galaxies which 
were observed as part of SDSS I/II are typically 
brighter and have ~ 0.4 dex higher stellar masses, 
which is to be expected given the brighter mag- 
nitude selection imposed as part of that survey. 
LOWZ galaxies have very similar colors to Legacy 
galaxies, and are typically redder than CMASS 
galaxies. 




Figure 2. The on-sky distribution (equatorial coordi- 
nates) of our sample, with the north and south Galac- 
tic caps (NGC and SGC, respectively) labeled for ref- 
erence. The black regions show the data used in the 
current study, while the light grey areas are the re- 
gions that had to be dropped because of the change in 
target selection. The background yellow/green color 
field shows the on-sky density of LOWZ targets in 
the full BOSS target area, representing the total area 
that will be covered when the survey is completed in 
2014. 



Figure [2] displays the on-sky distribution of 
the current sample. As noted above, because of 
changes in target selection we are not able to use 
roughly the first year's worth of data. This re- 
duces the total sky coverage compared with the 
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DR9 CMASS sample of I Anderson et all ([2012 1 



by 1205 square degrees (0.367 steradians), and 
removes a large part of the area studied in | White] 
et all p0TT|. The area with good LOWZ data 



shown covers 2467 square degrees (0.7517 stera- 
dians) on the sky. 

We consider data from the North Galactic 
Cap (NGC) and South Galactic Cap (SGC) sepa- 
rately in our analysis for a number of reasons. We 
lack a dark matter simulation that resolves the 
host halos of BOSS galaxies and is large enough 
to fit the NGC and SGC in a single simulation 
box. The SGC has a ~ 8% higher target density 
than the NGC, and thus a higher galaxy number 
density, mostly due to differences in photometric 
calibration and reddening correction between the 



hemispheres. Schlafly et al. (20101 and Schlafly 



& Finkbeiner (20111 found a difference in the 
SDSS colors between the NGC and SGC, result- 
ing in a 0.015 mag offset in c\\ (eq. [lj an d [3| . The 



"ubercal" SDSS photometry) Padmanabhan et al 



2008 ) uses overlapping regions of images to cross- 
calibrate the photometric measurments. As the 
NGC and SGC are not contiguous they are less 
well cross-calibrated than they are internally cali- 
brated. This produces some of the measured color 
difference between the NGC and SGC. Schlafly & 
Finkbeiner (20111 also identified a slight system- 



atic error in the reddening correction that was 



applied before targetting. Ross et al. (20111 de 



scribe the use of the Schlafly & Finkbeiner ( 2011 1 



color offsets, determined from stellar spectra, to 
correct for the north/south asymmetry. They find 
that the NGC/SGC CMASS and LOWZ num- 
ber density differences are completely consistent 
with the level of color offset found by |Schlafly fc| 
Finkbeiner (20111, but note the inherent uncer- 



tainties in the offsets and resulting corrections. 
They also find that these, and other, systematics 
more strongly affect the CMASS sample than the 
LOWZ sample, and are most important for clus- 
tering studies on the largest scales. As we will 
show, the resulting two LOWZ galaxy popula- 
tions are not significantly different in their clus- 
tering properties (see results in Sections |3.2| [4] 
and[5f . We provide values for the NGC and SGC 
separately, and also provide minimum-variance 
weighted values for NGC+SGC (hereafter, Full) 
when appropriate. 

Figure [3] presents the galaxy number density 
of our samples as a function of redshift. The ver- 
tical lines denote the redshift range used in this 
work (0.2 < z < 0.4). We restrict to this range as 




0.25 0.30 0.35 
redshift 



Figure 3. The redshift distribution of the BOSS 
LOWZ sample. The SGC and NGC are plotted sepa- 
rately as dashed and dashed-dotted lines, respectively; 
showing the higher number density in the SGC. The 
solid black line gives the total N(z) distribution for 
SGC+NGC. Our redshift range of 0.2 < z < 0.4 is 
marked with thin vertical lines. The original survey 
galaxy density goal of N(z) = 3 X 10~ 4 h 3 Mpc~ 3 is 
shown for reference (thin horizontal line), while the 
green horizontal lines show the NGC and SGC effec- 
tive mean density over the 0.2 < z < 0.4 range (values 
given in Table [l} 



Table 1. Statistics of the 0.2 < z < 0.4 galaxy sample 



region 


L> galaxy 


N(z) (10- 4 h 3 Mp C - 3 ) 


SGC (Legacy) 


3946 


1.502 


SGC (BOSS) 


19558 


2.681 


SGC (Legacy+BOSS) 


23504 


3.167* 


NGC (Legacy) 


18332 


0.990 


NGC (BOSS) 


37059 


2.005 


NGC (Legacy+BOSS) 


55391 


2.907* 


Full (Legacy) 


22278 


1.053 


Full( BOSS) 


56617 


2.198 


Full (Legacy+BOSS) 


78895 


2.981 



The N(z) values marked with an asterisk are used in 
the MCMC fitting procedure. 

it provides a relatively uniform number density 
across the redshift interval, and to distinguish it 
from the CMASS sample studied by | White et al.| 
(20111), which was restricted to 0.4 < z < 0.7. 



The dramatic increase in the number density be- 
low z ~ 0.2 is due in part to more than just 
massive red galaxies falling into the target selec- 
tion. Additionally, Tojeiro fc Percivaj] l]2011[) and 



Tojeiro et al. (20121 found that SDSS II LRGs 



below z < 0.2 had different dynamical growth 
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than LRGs at higher redshifts. A more in-depth 
study of the uniformity and completeness of the 
full LOWZ sample is in progress. The difference 
between the NGC (dash-dotted lines) and SGC 
(dashed lines) number density is clearly visible 
in the plot. We present some basic statistics of 
our sample in Table [I] including separate val- 
ues for the SGC, NGC, and Full NGC+SGC, and 
Legacy, BOSS, and Legacy+BOSS redshift sam- 
ples. One can see the significantly larger number 
of new BOSS redshifts compared to Legacy in the 
SGC, as SDSS I/II only observed three stripes in 
that region covering about 700 square degrees, 
compared with 3100 square degrees of imaging 



available to BOSS ( Aihara et al. 2011 1. The BOSS 



redshift densities in the two hemispheres should 
not be directly compared, because of the different 
number of Legacy redshifts in the NGC vs. SGC. 



3 REAL SPACE CLUSTERING 



3.1 Measurements 



The correlation function f(r) (Peebles 19801 mea- 



sures the excess probability of finding a galaxy in 
a volume element, dV, at separation r from a ran- 
domly selected galaxy, 



dP(r) =JVo(l + £(r))dV, 



(7) 



where Na is the mean galaxy number density. We 
use the estimator of Landy & Szalay (1993]), 

DD - 2DR + RR 



RR 



(8) 



Compared to the "natural" estimator DD/RR—1, 
the Landy and Szalay estimator reduces geomet- 
rical edge effects and minimizes variance. 

We use this estimator to calculate a two- 
point galaxy correlation function of pair separa- 
tions parallel (r\\) and perpendicular (r±) to the 
line of sight, £(rii , r±). We then compute the pro- 
jected correlation function, u p (r±), which reduces 
the effect of redshift-space distortions, by inte- 
grating, 



<*> p (r±) = 2 



Z(r\\,r±)dr\\ 



(9) 



where we take r\\ tUmx = 75/i _1 Mpc as the upper 
limit of the integral. The integral is stable around 
this value, while not contributing noise from large 
r\\ bins. The relationship between the projected 
correlation function, u) p (r±) , and the real-space 



uj p (r±) 



£[(ri+y 2 ) 1/2 ]dy 



(10) 



which is used in our full-box mock simulations to 
more quickly compute w p (r±) (see Section 3.2 1 



We again integrate to j/ max = 75ft _1 Mpc as a bal- 
ance between including most of the information 
from the correlation function, and limiting noise 
from large radius bins. We tested and confirmed 
that this integral produces almost identical re- 
sults to the integral over the correlation function 
in Eq. [9] as long as £(r) is measured in small 
enough bins to allow for smooth interpolation. 

Not all galaxy targets in each region on the 
sky were assigned spectroscopic fibers. Specifi- 
cally, the completeness of the survey varies be- 
tween the different regions defined by sectors 
which consist of disjoint regions defined by the 
overlap of spectroscopic tiles. In addition, as men- 
tioned above, some of our targets come "pre- 
observed", i.e., with redshifts from SDSS I/II. We 
thus separate galaxies into two groups and calcu- 
late their completeness, fgot, on a per-sector basis 
as 



f g ot BOSS 
f g ot Lcgacy 



-WBOSS/ (-^targets — -^Legacy) (11) 
1 



correlation function is (e.g. Davis & Peebles 1983 I 



where TVboss is the number of new, not con- 
firmed as star, BOSS redshifts, A/targets is the 
number of LOWZ targets and A^Logacy is the 
number of previously acquired Legacy redshifts 
that were targeted per the algorithm described 
above. For Legacy redshifts the completeness is 
defined to be 1, as they can be considered a 
separate, fixed, sample whose redshifts are pre- 
determined. Known Legacy redshifts were ex- 
cluded during targeting, so they do not affect 
whether any BOSS targets were allocated a fiber. 
If ^targets - ^Legacy = 0, the sector is assigned 
a completeness of 1, as all of the targets have 
Legacy redshifts, and no new BOSS redshifts were 
required. 

When computing the correlati on f unction, we 
weight galaxies by fgot - (see Eq. Ill, restricted 
to only those galaxies which lie in sectors with 
fgot > 0.5. For calculating the correlation func- 
tion, we generate random points uniformly in all 
regions with fgot > 0.5, and assign all randoms 
a weight of 1. We generate 100 times the number 
of data points for our random catalog as we have 
observations, ensuring the variance of the results 
is not affected by the random catalog. We assign 
redshifts to the randoms by smoothing the NGC 
and SGC redshift distributions with a 7th order 
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Figure 4. Left panel: contours of the correlation function £(r±,rn) (smoothed with a Gaussian kernel of radius 
0.5h~ 1 yipc for clarity) of the components parallel (rii) and perpendicular (r±) to the line of sight, for BOSS 
LOWZ galaxies with redshifts in the range 0.2 < z < 0.4. In both plots the blue lines show results for NGC 
data and green lines show results for SGC data. Solid contours are shown at £ = (1,2,4,8), dashed contours at 
£ = (0.5), and a thicker solid line is shown for £ = 1. Thin black dotted circles are plotted for £(r) = (2, 1,0.5) 
for reference. Right panel: the same, but for our mock galaxy catalog (Section |3.3| . Note that similar features 
(e.g. finger-of-god) are present in the mocks. In both panels, the discontinuity at rii = is due to small number 
fluctuations in the smallest bins. 



Chebyshev polynomial and drawing from each of 
the resulting distributions separately for the NGC 
and SGC random catalogs, respectively. 

To correct for fiber-collisions, we use the 



nearest-neighbor redshift method (Zehavi et al. 
p002] |2005a[ |Berlind et H] |2006| ). To each 
galaxy within a "collision group" that does not 
have a good redshift, we assign the redshift of 
the nearest galaxy within 62". Although this 
method is known to over-correct below the fiber- 
collision radius (62" for BOSS, corresponding to 
0.235/i _1 Mpc at the highest redshift in our sam- 
ple), it is a nearly exact correction at radii larger 
than twice the fiber-collision radius. Gu o, Ze^l 
|havi fc Zheng| ( |2011| | demonstrated a more exact 
method for corrections below the fiber-collision 
radius, but their method reduces the number of 
points included in the correlation function cal- 
culation, and thus increases the variance of the 
measurement. As the smallest radius bin we con- 
sider is well above the fiber-collision radius, and 



the results of Guo, Zehavi & Zheng (20111 show 



close correspondence between their new method 
and the nearest neighbor method for large radii, 
we will adopt the simpler method in this work. 



We show the correlation function contours 
in (r\\,r±) in Figure [4] alongside the mock cata- 
log 2D correlation function for comparison. The 
mocks show similar redshift space distortions to 
the data, with Fingers-of-God and large scale 
squashing due to the Kaiser effect ([Kaiser 1987) 
clearly evident. Our procedure for generating the 
redshift space mock catalogs is described in Sec- 
tion^ Following Eq.[9j we integrate this 2D cor- 
relation function to get the projected correlation 
function, w p (r±) shown inFigure[5] The errors are 
estimated from our mock catalogs (see Section 



3.2 I, shifted onto the sky and masked with the 



coverage map. As noted in Section [2] we present 
the NGC and SGC separately in the left-hand 
figure, and the minimum- variance weighted Full 
NGC+SGC in the right-hand figure. It is clear 
from the figure that over the scales considered in 
this paper, the clustering of the NGC and SGC 
are the same within the errors, differing by less 
than la. The CMASS errors from I White et al.l 



1 2011 )on large scales are significantly larger than 



ours, because that analysis was performed on a 
much smaller data set (580deg 2 , or roughly 30% 
the solid angle). The large scale errors are deter- 
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Figure 5. Left: NGC and SGC, data and mocks. Right: NGC+SGC Full sample. In each plot, the upper panel 
shows the projected correlation function for BOSS LOWZ galaxies with redshifts in the range 0.2 < z < 0.4. 
Dashed lines show the mean (not the single best-fitting model) mock correlation functions from our MCMC 
fitting procedure. Errors are computed from the square-root of the diagonal of our covariance matrix, generated 
using the mean HOD parameters. The thin line shows a simple best-fit power law, £pl = (^~\ _7 > transformed to 
w p,pl(^±) using eq. |10| to guide the eye. Also plotted on the right is the mean CMASS correlation function from 
| White et ah] (MIT) for comparison. The lower panels show the same curves as above, but with each curve divided 
by the power law fit to emphasize the differences between the various correlation functions, and to show that a 
pure power law fit is not appropriate for this data. The NGC and SGC correlation functions match within the 
errors. The correlation functions clearly display the inflection at 1 — 2h~ 1 Mpc that marks the transition between 
the one-halo and two-halo regimes ||Zehavi et al,|2004). 



mined primarily by the volume of the survey, so 
our increased volume improves on those errors. In 



addition, the geometry of the White et al. (2011 1 



sample naturally split into three disjoint regions, 
A, B, and C, reducing the number of large-scale 
pairs. 



We provide the correlation function and er- 
ror estimates in Table [2] for the NGC, SGC and 
Full NGC+SGC samples. The values for the Full 
sample are computed from a minimum-variance 
weighted combination of the NGC and SGC. 
The bins shown are those used during the fit- 
ting procedure described below. In the same ta- 
ble, we show the covariance matrix for the Full 
NGC+SGC data set. The galaxy and random cat- 



alogs used in this analysis are available on the 
SDSS III websitsE] 



3.2 Halo Occupation Distribution 

We estimate the errors in the sample and de- 
termine the dark matter halo statistics using 20 
dark matter simulations. These are the same sim- 
ulations use d in the CMASS analysis of |White| 
et al.| ( |201l[ ), with 1500 3 particles of mass 7.6 x 
in a periodic cube 1500ft Mpc on 
a side. All 20 simulations have the same cosmo- 
logical parameters: Qm = 0.274, Q,a = 0.726, 



1 http: //www. sdss3 . org/dr9/data_access/vac .php 
under Large Scale Structure Galaxy Catalogs 
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Table 2. The projected correlation function, w p (r±), in 12 equally spaced bins in logj 5 (fx)- 



r± 
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0.577 
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27.20 


18.80 


9.51 


5.60 


4.60 


3.17 


2.30 


1.76 


1.69 


1.36 


0.85 


0.87 


NGC iu p (r 


l) 


632.95 


404.09 


242.84 


152.32 


107.88 


85.42 


60.33 


43.41 


30.70 


19.52 


12.63 


7.07 


SGC w p (r_ 


l) 


598.26 


372.80 


240.20 


156.64 


104.49 


81.68 


60.67 


42.25 


30.72 


21.11 


12.83 


7.90 


NGC (T 




31.49 


15.53 


12.40 


6.735 


4.888 


2.746 


2.292 


1.527 


1.584 


1.376 


1.042 


0.9794 


SGC a 




52.58 


36.36 


25.99 


13.78 


8.555 


5.376 


4.354 


3.322 


3.138 


2.755 


2.309 


2.070 


0.385 




1 


0.5380 


0.5110 


0.4830 


0.2790 


0.2380 


0.3510 


0.3610 


0.4470 


0.3110 


0.2710 


0.2040 


0.577 






1 


0.5150 


0.4910 


0.3680 


0.4150 


0.4730 


0.3860 


0.2860 


0.2860 


0.2470 


0.2990 


0.865 








1 


0.7090 


0.6460 


0.4640 


0.3520 


0.3910 


0.3420 


0.3250 


0.1880 


0.1430 


1.299 










1 


0.6150 


0.5100 


0.4710 


0.5010 


0.5690 


0.4310 


0.2470 


0.0740 


1.945 












1 


0.6180 


0.6120 


0.6850 


0.5140 


0.4520 


0.4120 


0.2920 


2.921 














1 


0.6570 


0.5130 


0.4550 


0.3870 


0.2620 


0.1710 


4.381 
















1 


0.7850 


0.7230 


0.6470 


0.5540 


0.3980 


6.572 


















1 


0.7550 


0.6200 


0.4710 


0.2550 


9.858 




















1 


0.8300 


0.5600 


0.2640 


14.78 






















1 


0.7980 


0.5350 


22.18 
























1 


0.8750 


33.27 


























1 



Note: All bin values are in h 1 Mpc, at the volume-weighted bin center:((r| + ^ottomV^) 1 ^ 3 ' The lower part 



of the table lists the upper triangle of the covariance matrix as dj / (aidj). 



to = -1.00, fib = 0.0457, h = 0.70, n = 0.95, 
erg = 0.8, the same as the parameters given in 
the introduction for our redshift to distance con- 
version. We identify dark matter halos using a 



friends-of-friends (FoF) algorithm (Davis et al. 



|1985| l, with a linking length of 0.168. This results 
in a minimum resolved halo mass for our redshift 
slice (z = 0.30) of lO 11 ' 88 /^ 1 ^'/©. For more details 
on these simulations, see the appendix of | White] 
|et al.| poll} . Note that we are not fitting cosmo- 
logical parameters, and our CMASS comparison 
is straightforward because the assumed cosmolo- 
gies are exactly the same. 



We do not have a single cosmological simu- 
lation with a sufficiently large volume and a high 
enough number density to embed the NGC and 
SGC in the same box, but our simulations are 
large enough if the NGC and SGC are considered 
separately. We can safely ignore correlations be- 
tween the NGC and SGC in this analysis, as the 
shortest distance between an NGC and an SGC 
galaxy is 53°-corresponding to > 600 h~ Mpc at 
the sample's minimum redshift -while this paper 
only considers separations below 40 h~ Mpc. Our 
technique of fitting each separate region into our 
simulation cubes to generate mock galaxy cata- 



logs for error estimation is discussed in Section 

331 

For an accurate interpretation of galaxy clus- 
tering, we use the halo occupation distribution 
(HOD) f ormulation (|Peacock fc Smith|2000| [SeT 
jak|2000[ [Benson et al.|2000||White, Hernquist fc 
Springcl 2001: Scoccima rro et al.|2001|[Berlind Ez 



[Weinberg 2002). The HOD gives the conditional 
probability that a halo with virialized mass Mh a i 
contains N galaxies of a particular class. In this 
paper, we distinguish central and satellite galax- 
ies, and we require that halos with a satellite must 
have a central galaxy. 

We determine the number of satellite and 
central galaxies in each halo following a halo pre- 



scription based on that of Zheng et al. ( 2005 ) with 



a central galaxy probability of 



No 



i(Mhaio) = -erfc 



ln(M cut /M h 



V2o 



(12) 



and an expected number of satellites equal to 



A r sa t(M halo ) 



/ Mhalo 



KM cut 



Mi 



(13) 



where M cu t, Mi, a, k, and a are the free- 
parameters to be fit in our model, described in 
detail in Appendix [X] Briefly, M cut is a mini- 
mum mass for halos to host our galaxies, Mi is 
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a typical mass for halos to host one satellite, a is 
the scatter between M» and Mh a i , K allows the 
threshold mass for satelites and centrals to differ, 
and a is the mass dependence of the efficiency 
of galaxy formation. This halo prescription was 
created to reproduce the observed luminosity- 
dependent clustering and number densities from 
the SDSS main galaxy (jStrauss et al.][ 2002) and 
LRG samples (Eisenstein et al. 20011. We as- 



sume that for a halo to contain satellites it must 
first contain a central, so we only assign satellites 
to halos with a central galaxy. Thus, the total 
expected number of galaxies in a halo of mass 

M halo is 

<iV g ai(M halo )> = (iVcc„(M ha , ))(l + (iV sat (M hal< {»$) 

We assign central galaxies to halos when a uni- 
form random deviate is less than the value of Eq. 
|12| for that halo. We then compute the number 
of satellites in each halo hosting a central by se- 
lecting a value from a Poisson distribution with 
A = iVsat as given by Eq. [13] 

We assign central galaxies to the halo cen- 
ter position and place satellites at the position of 
a randomly-selected dark matter particle within 
the halo. This eliminates the common assump- 



tion of spherical NFW profiles (Navarro, Frenk 



& White 1996 1 and preserves the non-spherical 



halo shapes. In a study of how halo occupa- 
tion assumptions affect galaxy clustering statis- 
tics, Zu et al. ( 2008 1 and van Daalen, Angulo & 
|White| ( |2012f both found that assuming spheri- 
cal halos could decrease the measured correlation 
function by up to ~ 10 — 20% on scales below 
r < l/i _1 Mpc, compared to the clustering mea- 
sured with the true halo shapes. They also re- 
ported a reduction in the correlation function on 
scales of a few /i~ 1 Mpc if the halos are not cor- 
rectly aligned within the overall large scale struc- 
ture. By placing galaxies at the location of dark 
matter particles within each halo, we preserve 
both the halo shape and the overall halo align- 
ment. An even better choice would be to place 
the galaxies at the locations of subhalos, but our 
simulations do not have the resolution to track 
individual subhalos. In either case, this method 
more correctly reproduces the l-halo/2-halo tran- 



sition region of the correlation function ( Cooray 
fc Sheth|2002||Zehavi et al.|2004[ ) than distribut- 
ing galaxies via spherical NFW profiles. 

See Appendix [X] for details on the physical 
motivations of these HOD parameters and exam- 
ples of how they each affect both the shape of the 
HOD and the resulting correlation function. 



3.3 Error estimates 

We perform our fitting procedure below on the 
projected measurements (e.g., Eq. [9] and Figure 
|HJ, without incorporating information about the 
galaxy peculiar velocity field. To estimate mea- 
surement errors, we must shift our mock catalogs 
from real into redshift space, place them on the 
sky, and mask them with the geometry. In addi- 
tion, we effectively double the number of available 
mock galaxy catalogs by flipping each simulation 
around one axis when generating the mock galaxy 
catalog below, to double our effective number of 
simulation boxes from 20 to 40. The flip is cho- 
sen to minimize (NGC) or eliminate (SGC) over- 
lap between the flipped and non-flipped versions 
of the box. The overlap in the NGC between the 
flipped and non-flipped boxes is less than 10%, so 
there should be minimal signal in the covariance 
matrix due to this procedure. Since the geome- 
try of our simulations is a cube, while the survey 
geometry is a much more complicated region de- 
fined by sectors on the sky (described at the end 
of Section [2j and represented graphically in Fig- 
ure [5| and a radial selection function (i.e., Figure 
|3J, we remap the periodic simulation cube into 
a rectangular parallelepiped via the method and 



code of Carlson & White (2010), and then restrict 
it to the on-sky mask (see Figure [2J. 

The remapping procedure applies a shear 
transformation to the periodic simulation cube. 
Because the simulation box is periodic, we embed 
a new sheared box into the infinite tiling of peri- 
odic boxes and take the new catalog be the points 
at their new, sheared, positions. This transforma- 
tion preserves the simulation volume and number 
density and contains each point once and only 



once (see Figure 2 in Carlson & White (20101 for 
a 2d graphical representation of this procedure). 
This remapping procedure requires that the box 
size of the simulations be large enough so that it 
preserves large scale structure-an excessively thin 
sheared box will remap too many distant points 
to be in close neighborhoods. Note that we also 
must rotate the galaxy velocities (described be- 
low) by a rotation matrix defined by the normal- 
ized basis vectors defining the new sheared box. 
This places the galaxy velocity vectors, originally 
in the coordinate system of the simulation cube, 
in the coordinate system of the remapped box. 

We then embed the survey geometry into the 
remapped simulation box on the sky. We shift the 
origin of the coordinates of the simulation box so 
that its minimum (a;, y, z) coordinate corresponds 
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to the minimum (a;, y, z) coordinate for the data 
(in the NGC, and SGC separately), effectively 
placing the mock galaxy box on the sky in the 
same location as the galaxy data, but covering a 
larger area. 

To convert our mock catalogs into redshift 
space, we must assign a peculiar velocity to each 
galaxy. Central galaxies are assigned the bulk ve- 
locity of their host halos, which should produce 
a flattening of the correlation function along the 



line of sight due to large scale motion (Kaiser 
|1987| >. For satellites, we assign the peculiar veloc- 
ity of each galaxy's associated dark matter parti- 
cle, as this includes both the bulk halo motion and 
the infall velocity. This should result in Fingers- 
of-God. As our simulation velocities are "distance 
offsets" relative to the position coordinates, we 
can convert the real space coordinates, r, into red- 
shift space coordinates, s, via 



s = r + r 



(15) 



where it is the galaxy velocity as given above 
and r is the line-of-sight unit vector from the 
observer's coordinates (0, 0, 0) to the galaxy. We 
then restrict the on-sky coordinates of the galax- 
ies to the area described in Section [2] 

To measure the correlation function on these 
redshift space mocks, we use exactly the same 
binning as was used for the data, measuring ei- 
ther £ (s) directly, or w p (r±) by integrating Eq.[9] 
The error bars shown in Figures [5] and [8] and the 
covariance matrix used in the MCMC procedure 
and listed in Tableware computed from the stan- 
dard deviation of the 20 mock galaxy correlation 
functions. 



3.4 MCMC fitting 

To determine the best-fit HOD parameters, we 
use a Markov Chain Monte Carlo (MCMC) 
method to fit w p (r±) and the overall galaxy num- 
ber density, as measured from the BOSS data. 
In particular, we use the Metropolis-Hastings 
MCMC fo rmalism ( [Metropolis et al.||1953} |Hast 
ings|19 70), with errors on w p (r±) from the mock 
catalogs and assuming a fixed 15% error on the 
galaxy density (see below for justification). This 
method starts with an initial seed set of values 
for the HOD parameters and then, 



(i) Computes a step in a direction determined 
from the covariance of the parameters (as dis- 
cussed in Appendix [AT). 



(ii) Populates the dark matter simulation with 
galaxies according to that HOD. 

(iii) Computes £(r) on the resulting mock 
galaxy catalog. 

(iv) Integrates £(r) to obtain w p (r±_) via Eq. 

EES 

(v) Computes the \ 2 between the model and 
the data. 

We accept the new HOD parameters with prob- 
ability: 



P = min(l, e 



-(x£ 



-xi td )/2s 



(16) 



We then iterate this procedure for at least 25, 000 
steps, thus filling out the most likely region of 
parameter space, which allows us to easily de- 
termine the most likely value and scatter of any 
statistic derived from the HOD parameters. For 
the first run of the MCMC procedure, we boot- 
strap with fixed 20% errors on all correlation 
function points, and then use the error estimated 
from this fit in the following run. In each subse- 
quent MCMC run, we use the errors on w p (r±) 
estimated from the previous run (see Section 3.3 
for how the error estimates are generated), and 
iterate this process until convergence, defined as 
the mean correlation function of two runs being 
the same within the la errors. The fitted parame- 
ters converge after 3-4 MCMC runs, and the best- 
fit and mean values, and estimated errors given 
in this paper, are taken from the final run. 

Although the galaxy number density (Figure 
[3| varies by > 30% across our redshift range, the 
mock-to-mock variance in galaxy number density 
is ~ 2%. The survey is not volume limited, while 
the mocks are, by construction. In our MCMC 
fits, we chose a 15% error on N(z) to allow for 
variation across the redshift range, while still re- 
stricting the MCMC code to values close to the 
mean of the survey. Through testing, we found 
that a tighter restriction, e.g. 5%, prevents the 
chains from converging on a solution. A more 
complete mock catalog generation process would 
assign luminosities and colors to the galaxies and 
observe them on the sky in the same manner that 
the data was observed, resulting in a mock catalog 
with a redshift distribution closer to that of the 
data (e.g. the method of |Skibba fc Sheth|2009 |. 

Because of variance between simulations, we 
perform our MCMC fitting on the "mean" sim- 
ulation, defined as follows. We compute the cor- 
relation function, (£(r)}, of halos with Mhaio > 
10 13 /i _1 Mq, and select the simulation with the 
smallest sum-of-squares difference from the mean 
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Figure 6. Halo Occupation Distributions (HODs) for 
LOWZ (NGC+SGC) and CMASS samples compared 
with HODs for two samples of LRGs with photomet- 
ric redshifts from SDSS I/II ("DR6: 2" with a mean 
redshift of 0.326 and "DR6: 3" with a mean redshift of 
0.376). The dashed lines shows the expected number 
of centrals, dashed-dotted lines show expected number 
of satellites, and solid lines show the total number of 
galaxies as a function of halo mass. The shaded region 
shows the ±lcr variation determined by our MCMC 
fitting procedure. 



of the 20 correlation functions. This ensures that 
our correlation function fitting procedure is not 
biased high or low due to a particular simula- 
tion box having particularly high or low inher- 
ent halo clustering. Restricting to high mass ha- 
los is a rough proxy for the halos occupied by 
our galaxies, without any of the randomness in- 
volved in assigning centrals and satellites to halos. 
An alternative to choosing the "mean" simulation 
would be to perform the fitting procedure on all 
20 simulations at once, but this is computation- 
ally prohibitive. 

We fit the correlation function in the 12 ra- 
dial bins given in Table [2] plus the average galaxy 
number densities given in Table [l] When com- 
bined with our 5 parameter model, there are 7 
degrees of freedom for the ^ 2 -test portion of the 
MCMC. Our mean HOD model has a X 2 of 7.35 
for the NGC and 5.73 for the SCO 

Our MCMC procedure produces the mean 
(calculated from all MCMC steps) and best-fit 
(lowest single \ 2 during the MCMC procedure) 
parameters shown in Table [3] The parameter k 
is poorly constrained, and although a is also not 
strongly constrained, it is below the value for the 
CMASS sample (0.98 ± 0.24). We estimate our 
errors from the mean HOD parameters, not the 
best-fit, as the single best-fit value is partly de- 



Figure 7. Probability distribution (normalized to 
have an integral of 1) of satellite fractions for NGC 
and SGC. 



termined by random variations in the mock cor- 
relation functions. Figure [6] presents the mean 
number of galaxies per halo. We also show, for 
reference, the HOD determined for the CMASS 



sample of White et al. (20111, and two HODs 



selected from Padmanabhan et al. (2009) that 



represent galaxies with photometric redshifts in 
a similar redshift and mass range to those in our 
current sample. The LOWZ sample clearly lies 



between the Padmanabhan et al. (2009) samples 



2 and 3, with a steeper cutoff for central galaxies 
than sample 2. The behavior of the HOD at low 
halo masses is driven primarily by the amplitude 
of the correlation function; the relatively large 
measured clustering amplitude (compared with, 
e.g. CMASS) agrees well with our steep cutoff in 
the average number of galaxies in low mass ha- 
los. Within the HOD framework, galaxies with a 
high bias must occupy high mass halos, resulting 
in a sharper turnoff in the central galaxy fraction. 
For a more detailed comparison with CMASS, see 
Section [5] 

Our results suggest that 12±2% of NGC and 
11 ± 2% of SGC galaxies are satellite galaxies in 
their halos instead of centrals. This is compara- 
ble with the ~ 10% satellite fraction measured 



by White et al. (20111 for the higher redshift 



CMASS sample. We show the probability distri- 
bution function (PDF) of satellite fraction in Fig- 
ure [7] There is considerable overlap between the 
NGC and SGC satellite fraction PDFs, and we 
find these results to be statistically indistinguish- 
able. 
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Table 3. The mean and standard deviation of the HOD parameters (see Eg. |12|and|13| 



parameter 


mean Full 


mean NGC 


best-fit NGC 


mean SGC 


best-fit SGC 


log 10 M cut /M e 


13.25 ±0.26 


13.17 ± 0.14 


13.16 


13.09 ±0.09 


13.11 


log lo MiM 


14.18 ±0.39 


14.06 ±0.07 


14.11 


14.05 ± 0.09 


14.07 


a 


0.70 ±0.40 


0.65 ± 0.27 


0.741 


0.53 ±0.28 


0.692 


ft 


1.04 ±0.71 


1.46 ±0.44 


0.921 


1.74 ± 0.74 


1.26 


a 


0.94 ±0.49 


1.18 ±0.18 


1.38 


1.31 ± 0.19 


1.31 



4 REDSHIFT SPACE 

As a test of our HOD fits, we compute the redshift 
space correlation function for the data, and for 
our mean HOD parameters. As our HOD fitting 
procedure does not incorporate any information 
about the velocity distribution of the galaxy sam- 
ple, this approach provides a convenient check of 
our results. 

Figure [8] shows the redshift-space correlation 
function, £(s). We also plot the same power law 
from Figure [5] and the mean redshift space mock 
catalogs for comparison. The effects of redshift 
space distortions are clear here, with a decrease in 
the correlation function amplitude at small scales, 
and an increase at larger scales. 

The mocks in this case were not fit to the 
data, but rather were computed using the HOD 
parameters that were fit to the w p (r±) measure- 
ments. The differences between the data and 
mock £(s) could be due to our requirement that 
every halo with satellites must have a central: 
the lower luminosity galaxies in our sample may 
be satellites in halos that do not have a LOWZ 
galaxy as their central. The difference may also 
reflect deviations from our assumption that the 
galaxies strictly follow the motion of individual 
dark matter particles, as opposed to the subha- 
los that they truly do occupy. Further work could 
expand on this issue by, for example, making dif- 
ferent assumptions about satellites and centrals, 
selecting subsets of LOWZ galaxies with differ- 
ent colors and luminosities, or having the galaxies 
follow proper subhalos instead of individual dark 
matter particles (see, e.g., |Seljak 



2001; 



White 



200T| |Scoccimarro|[2004[ |Skibba et al.|2006| [Tin- 
ker, Weinberg fc Zheng||2006| ). 



COMPARISON WITH PREVIOUS 
WORK 



Figure [9] shows a different view on the halo 
occupation of these galaxies: the probability that 
a galaxy lies in a halo of mass M. We compute 
the mean halo mass to be 5.2 x lO 13 /i _1 M0 for 
the LOWZ sample. This Figure clearly shows the 
sharper halo occupation cutoff at low halo masses, 
compared with the CMASS sample, which had a 
mean halo mass roughly half as large. This dif- 
ference is likely due to the fact that our galax- 
ies are redder than CMASS (Figure [I]), and thus 
would tend to occupy higher mass halos (see, 
e.g., Skibba et al. 2009). In addition, a galaxy 
population undergoing dynamical passive evolu- 
tion would occupy higher mass halos with cos- 
mic time, as their original host halos merge to 
form more massive halos. This is consistent with 
the fact that there is little difference between the 
LOWZ and CMASS stellar masses (Figure [l]): 
these galaxies have undergone considerable halo 
growth, but little to no stellar mass growth since 
z ~ .5 (see also|White et al.||2007"l |Brown et "aT 



[2008] [Wake et al.|2008| 

Based on our halo fitting, we can estimate 
the bias of this galaxy population with respect to 
the underlying dark matter distribution, 



6(r) 



where £ ga i(r) is the correlation function of the 
mean galaxy mock (see Table [3b , and £dm(?") is 
the correlation function of dark matter in our sim- 
ulations. We find a large scale galaxy bias of ~ 2.0 
and show the scale dependence of the bias in Fig- 
ure [lO] The dot-dashed blue line shows the bias 
relative to the linear theory correlation function 
from CAMB (|Lewis, Challinor fc Lasenby||2000[ 



Howlett et al.|2012| |, using the same cosmological 
parameters as our simulations. The linear bias dif- 
fers from the non-linear bias on small scales but 
is similar on large scales, as expected. We find 
a similar bias to that reported for the CMASS 



sample of White et al. (2011 1 (dashed red line) 
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Figure 8. Left: NGC and SGC, data and mocks. Right: NGC+SGC Full sample. Upper panels: the redshift space 
correlation function for BOSS LOWZ galaxies with redshifts in the range 0.2 < z < 0.4. The thin line shows the 
same best-fit power law from Figure [5] with so = 8.9, 7 = 1.9. The dashed lines show the mean of the 20 mock 
catalogs generated from our simulations, using the mean HOD parameters given in Table [3] Lower panels: the 
same curves as above, but divided by the mean of the NGC and SGC to emphasize the differences between the 
various correlation functions. The thin solid line at £(s)/mean(NGC, SGC) = 1 is not the powerlaw from the 
upper panel, but is just shown for reference. 
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Figure 9. The probability that a galaxy lies in a halo 
of a given mass, comparing CMASS and the NGC and 
SGC LOWZ sample. Because the HOD for LOWZ 
galaxies is more sharply truncated than the CMASS 
HOD, LOWZ galaxies do not probe halos with masses 
as small as the those of the CMASS sample. 



The measured bias of the full CMASS and 
LOWZ samples is inconsistent with pure dynam- 
ical passive evolution of CMASS into LOWZ. In 
a dynamical passive evolution model in our cos- 
mology, the bias of a galaxy population evolves 
like 



b(z -> z) = (b Z[) - 1) 



D(z ) 
D{z) 



+ 1 



(17) 



(Fry 1996). Thus, a bias of 2 at z = 0.55 should 
evolve to 1.88 at z = 0.3. Equivalently, the 
z — 0.55 progenitors of a galaxy population with 
b(z = 0.3) = 2 shoul d have b(z = 0.55) = 2.13. 
|Tojeiro et al.| ( |2012[ ) suggest that the progeni- 
tors of SDSS II LRGs are preferentially located in 
the redder parts of the color/luminosity selection 
space. Together, this suggests that the LOWZ 
sample could have passively evolved from a red- 
der, slightly more biased subset of the CMASS 
sample. Further work fitting HODs at high red- 
shift and evolving them to low redshift (e.g. 
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Figure 10. The scale dependence of the galaxy bias, 
b = y/£gai/£DM\ f° r the LOWZ sample. The large- 
scale bias asymptotes to ~ 2.0. The strong increase 
toward scales below lh~ x Mpc appears because of the 
strong clustering of galaxies within halos, while the 
bump at the few /i _1 Mpc scale is due to one-halo/two- 
halo transition. The dashed red line shows the galaxy 
bias of the CMASS sample of |White et~aT| $20U\ , 
while the dot-dashed blue line shows the LOWZ 
galaxy bias relative to the linear theory 5dm com- 
puted with CAMB. 



Zheng, Coil fc Zehavi||2007| | White et al1[2007| 
Wake et al.||2008[ ), as well as selecting subsets of 
the CMASS sample that may be more represen- 
tative of the LOWZ progenitors could clarify this 



Zehavi et ai] ( |2005b[ ) and |Zheng, Coil fc Ze- 



havi| (2007) studied 200, 000 galaxies from the 
SDSS I/II main sample in the redshift range 
0.01 < z < 0.23, with the latter using the same 
correlation function measurements but a more 
complicated form for the HOD. Their samples 
were all less luminous than the LOWZ sample, 
with the most luminous sample having —23 < 
M r < -22 and 0.1 < z < 0.23. They found 
a large-scale bias factor of 1.91 for the most 
luminous sample, with the other samples hav- 
ing smaller bias. They also measured the mean 
halo mass to be higher for fainter red galaxies 
~ 2 x 10 14 /i _1 A/ Q than for those of intermedi- 
ate luminosities ~ 1 x 10 14 /i _1 A/q. Both of these 
mean halo masses are significantly higher than 
the mean halo mass we find for LOWZ galax- 
ies. Zheng, Coil & Zehavi (20071 also added clus- 



tering measurements from 30, 000 DEEP2 galax- 
ies in the redshift range 0.7 < z < 1.45. The 
bias of this sample ranged from 1.22 to 1.45 for 
the lower (Mb < —19.0) and higher luminos- 
ity (Mb < —20.5) samples, respectively. They 



found satellite fractions for the luminous SDSS 
and DEEP2 samples (both ~ 10%) similar to 
our LOWZ results. The results o f |Zheng, Coil 
& Zehavi (20071 were updated in Zehavi et al. 
( |2011| ), incorporating the completed SDSS I/II 
main sample galaxy catalog. Their two most lu- 
minous samples, with M™ ax > -22.0 (z < 0.245, 
N = 0.5 x 10- 4 /i 3 Mpc- 3 ) and A/ r max > -21.5 
(z < 0.199, N = 2.8 x 10~ 4 /i 3 Mpc~ 3 ), were the 
most similar to LOWZ, having b — 2.16 ± 0.05, 
/sat = 4 % ± 1% and b = 1.67 ± 0.03, / sat = 
9% ± 1%, respectively. 



|Zheng et al.| ( |2009[) used the co rrelation func 



tion measurements of Zehavi et al. ( 2005a I to ex- 



plore the host halos of 35, 000 luminous red galax- 
ies from the SDSS, with two samples covering 
0.16 < z < 0.36 and -23.2 < M 9 , z=0 . 3 < -21.2, 
and 0.16 < z < 0.44 and -23.2 < M 9 , z=0 . 3 < 
-21.8. Their M r , z=0 .3 < -21.8 sample has a 
higher mean halo mass (~ 10 14 /i _1 M©) com- 
pared to LOWZ, while their M r , z=0 .3 < -21.2 
has a very similar mean halo mass (~ 4.5 x 
10 13 /i~ 1 AfQ). Both of these samples have lower 
satellite fractions (~ 6% and ~ 3%, respec- 
tively) than our LOWZ sample. A major dif- 
ference between these samples and the LOWZ 
sample is their 3 times lower number density of 
~ 10~ 4 /i. 3 Mpc -3 , which would impact both the 
satellite fraction and halo masses. 



Wake et aL] ( |2008[ ) fit HOD parameters to 
color and luminosity matched SDSS and 2SLAQ 
(Cannon et al.||2006"| ) LRGs. They created four 
matched samples by selecting SDSS galaxies with 
the 2SLAQ color and magnitude cuts, and 2SLAQ 
galaxies with the SDSS color and magnitude cuts, 
resulting in two samples at low redshift (z ~ 0.2) 
and two at high redshift (z ~ 0.55). They found 
that the low redshift samples had higher mean 
halo masses than LOWZ (9.52 x 1O 13 /i _1 M and 
7.62 x 10 13 /i _1 A/ Q ), but similar satellite fractions 
of around 10%. On the other hand, their high red- 
shift samples had lower satellite fractions (4.7% 
and 6.2%), but more similar mean halo masses 
(6.24 x 10 13 /i _1 Af Q and 4.76 x 10 13 /i _1 M Q ). All 
of their samples had between 1.5 — 4 times lower 
number density than the LOWZ sample, ranging 
from 0.73 - 1.65 x 10" 4 /i 3 /Mpc~ 3 . 

Figure [TT] compares our measurements of 
HOD parameters M cut and Afi versus galaxy 
number density, N z , with other studies from 
the literature. The HOD parameter estimates 
shown in this plot are taken from studies cov- 
ering a range of redshifts, sample selection, spec- 
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Figure 11. Maui and Mi vs. N z for a number of 
different galaxy correlation function studies. Error- 
bars, not shown for clarity, are typically ~ 0.1 dex. 
The labels refer to the following studies; SDSS LRG 
(2PCF): |Zheng et al.| p009) , SDSS LRG (Lensing): 
|Mandelbaum et al.| (|2006|)' SDSS LR G (Photo-z, 
BCL): |Blake, Collister fc Lahav| p)08) , SD SS LRG 
(Photo^2 TPW):|Padmanabhan et al.| ( |2009}, Combo- 
17 (Re d) : |Phleps et al.| j2006|l, NDWFS : iBrown et al.| 
p008) , 2SLA Q (Wake++):|Wake et al.| ( [2008| >, SPSS 
LRG (3PCF ):|Kulkarni et al.M2007[ , BOSS CMASS 
(White++): |White et al.| < |201lj . The data in this plot 
is provided in the Appendix in Table [Al] 



troscopic and photometric catalogs, measurement 
techniques, HOD fitting methods, and survey vol- 
umes. Our results fit with the general trend of 
M cu t and Mi decreasing for higher N z . The com- 
pilation of different datasets was use d in|Brown 



et al. (20081, White et al. (20111 and Nuza et al 



( 2012 1. This result is consistent with the observa- 
tion that galaxy number density is a rough proxy 
for the survey's luminosity or stellar mass limit 
(e.g., |Tinker et al.|[2005j |Zheng et ah1|2005| |Ze~ 



havi et al.||2011 1. For future reference, this data 



is also compiled in Table [AT] 



6 CONCLUSIONS 

We have described the clustering properties of 
~ 80, 000 BOSS LOWZ galaxies from the SDSS 
DR9 sample. Our measurements, fitting proce- 
dure, and mock catalogs provide the following 
properties of this sample: 

• When working with the LOWZ catalog, we 
recommend that approximately the first year's 
data should not be included in uniform samples 
because it was acquired with a shallower selection 
function. We give a description of how to restrict 



the sample to the correct targets. We incorpo- 
rate data from SDSS Legacy to provide about one 
third of our redshifts, and provide a description of 
how to include these data when generating large 
scale structure catalogs. 

• The clustering of the NGC and SGC sam- 
ples differ by less than la, even though the south 
has a ~ 10% higher number density because of 
variations in photometry. We present correlation 
function and fitted parameter values for the Full 
NGC+SGC sample and for both the NGC and 
SGC separately. 

• The LOWZ sample has a higher correlation 
function amplitude (ro ~ 8.9), but a similar bias 
(b ~ 2.0) compared to the higher redshift CMASS 
sample. 

• The best-fit and mean HOD both result in a 
higher average host halo mass (5.2 x 10 13 /i _1 Mq) 
and steeper mass cutoff for the LOWZ sample 
compared to the CMASS sample, but well within 
the range of similar photometric galaxy samples 
from SDSS I/H. 

• Our HOD fits result in a satellite fraction of 
~ 11% for the LOWZ sample. 

• The LOWZ sample is broadly consistent 
with being passively evolved analogs of a subset 
of the higher redshift CMASS sample. 
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APPENDIX A: HOD PARAMETERS 

The central galaxy occupation function in Eq. [12] 
allows for a log-normal scatter between galaxy lu- 
minosity and halo mass with logarithmic disper- 
sion a, so that the occupation of central galaxies 
rises smoothly from zero to one. The satellite oc- 
cupation function of Eq.[l3]is a power-law of slope 
a smoothly truncated at low halo masses, normal- 
ized by the mass Mi at which halos have approxi- 
mately one satellite on average. The parameter k 
allows different cutoffs in the central and satellite 
occupations, and it ensures that the normaliza- 
tion of the satellite occupation is not driven arti- 
ficially by the behaviour of low-occupation halos. 
Occupation functions of this form provide a good 
fit to the theoretically predicted occupations of 
galaxy samples limited by stellar mass or lumi- 
nosity ( |Kravtsov et al.||2004| |Zheng et al~1|2005 k 
Our choice of HOD represents a step function 
with a lower mass threshold for the number of 



centrals (Eq. 12 1, smoothed out and increasing 



slowly to a probability of 1 over some range (the 
complementary error function). Similarly, as we 
expect the number of satellites (Eq. 13 1 to in- 
crease at high halo mass, we assume a power law 
for the number of satellites, with a minimum mass 
cutoff («M cu t) to prevent small halos from hav- 
ing satellites. Because of the color selection and 
the modest dependence of luminosity threshold 
on redshift in the BOSS LOWZ sample, we may 
expect the dispersion a to be somewhat larger 
than it would be for a sharply thresholded sam- 



et al. 



(20051; Zheng, Coil & Zehavi (20071; Wake et al. 



pie. See, e.g. Berlind & Weinberg (2002); Hamana 



(20041; Zehavi et al. (2005a I; Zheng et al. 



12008) for other choices of HOD parameteriza- 
tion, some of which are analogous to ours. 

As a rough guideline, the parameters affect 
the resulting HOD, and thus the correlation func- 
tion in the following ways. M cut determines the 



characteristic minimum mass of halos in which 
a galaxy is allowed, and thus affects the over- 
all amplitude of the correlation function, with 
smaller values of M cu t resulting in a lower am- 
plitude. The quantity a changes the shape of the 
"central galaxy probability function", with smaller 
a resulting in fewer galaxies populating lower 
mass halos with lower large-scale bias, and thus 
a higher correlation function amplitude on large 
scales. Mi affects the number of satellites, with 
smaller Mi producing more satellites, and thus 
a higher amplitude for the correlation function 
overall, and a steeper slope at small radii. Increas- 
ing a increases the number of satellites, adding 
power on small scales, and affecting the one- 
halo/two-halo transition. Varying k has a small 
effect, tuning the mass at which satellites are al- 
lowed and slightly altering the small-scale correla- 
tion function. These parameters have some degen- 
eracy, with a and M cu t, and a and Mi positively 
correlated, and Mi and M cut negatively corre- 
lated; we find little dependence on « within the 
range we sample, < n < 3. Other studies that 
explore the relation of the HOD parameters with 



the resulting correlation function includ e |Berlind 
fc Weinberg] ([2002| ; |Zheng et al.| p009| ; | Watson 
Berlind fc Zentner ( |2011[ ). 

Figure |A1| shows how the correlation func- 
tion, w p (r p ), and the satellite and central halo 
functions vary as each individual HOD parameter 
is changed over a range of reasonable values. Each 
panel includes one set of curves with the values 
Mi = 13.00, M cut = 14.00, a = 1.00, k = 1.00, 
a = 1.00 as a fiducial. The degeneracies in the 
correlation functions between the parameters are 
visible, as well as the effects that each has on the 
shape and amplitude of the resulting correlation 
function. 

The values of M cu t and Mi from previous 
studies used in Figure [TT| are given in Table [AT] 
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